Statistical analysis of nucleotide sequences.
نویسندگان
چکیده
In order to scan nucleic acid databases for potentially relevant but as yet unknown signals, we have developed an improved statistical model for pattern analysis of nucleic acid sequences by modifying previous methods based on Markov chains. We demonstrate the importance of selecting the appropriate parameters in order for the method to function at all. The model allows the simultaneous analysis of several short sequences with unequal base frequencies and Markov order k not equal to 0 as is usually the case in databases. As a test of these modifications, we show that in E. coli sequences there is a bias against palindromic hexamers which correspond to known restriction enzyme recognition sites.
منابع مشابه
Intraspecies Gene Variation within Putative Epitopes of Immunodominant Protein P48 of Mycoplasma agalactiae
P48 protein of Mycoplasma agalactiae is used to diagnose infection and was identified as potential vaccine candidate. According to the genetic nature of mycoplasma and variable sensitivity in P48-based serological diagnosis tests, intra species variation of P48 nucleotide sequence investigated in 13 field isolates of difference province of Iran along with three vaccine strains. Samples were col...
متن کاملComparative genomics of human stem cell factor (SCF)
Stem cell factor (SCF) is a critical protein with key roles in the cell such as hematopoiesis, gametogenesis and melanogenesis. In the present study a comparative analysis on nucleotide sequences of SCF was performed in Humanoids using bioinformatics tools including NCBI-BLAST, MEGA6, and JBrowse. Our analysis of nucleotide sequences to find closely evolved organisms with high similarity by NCB...
متن کاملPhylogenetic analysis and genetic variation of Tomato yellow leaf curl virus based on the V1 gene in Iraq
Tomato yellow leaf curl virus (TYLCV) is a supreme pathogen in tropical and subtropical areas. During 2014-2015, a total of 393 tomato samples showing Tomato yellow leaf curl disease (TYLCD) symptoms were collected from six different provinces of Iraq. In serological assays, 55 out of 393 samples (14%) reacted positively with TYLCV-specific antibodies .The presence of TYLCV was verified in 21 (...
متن کاملThe Major Sources of Genetic Differentiation Among Apricot Latent Virus (ApLV) Isolates
Background and Aims: Apricot latent virus (ApLV) is a species within Foveavirus genus (Betaflexiviridae family, Tymovirales order). Phylogenetic analyses using different ORFs nucleotide sequences divided most ApLV isolates into two clusters. However, there is little data about the sources of genetic differentiation among ApLV isolates. Materials and Methods: Partial coat protein (CP) sequences...
متن کاملPhylogenetic Analysis of Three Long Non-coding RNA Genes: AK082072, AK043754 and AK082467
Now, it is clear that protein is just one of the most functional products produced by the eukaryotic genome. Indeed, a major part of the human genome is transcribed to non-coding sequences than to the coding sequence of the protein. In this study, we selected three long non-coding RNAs namely AK082072, AK043754 and AK082467 which show brain expression and local region conservation among vertebr...
متن کاملComparison of Phylogenetic and Evolutionary of Nucleotide Squences of HVR1 region of Mitochondria genom in Goats and Other Livestock Species
Maintaining genomic diversity in goat populations in different parts of Iran is essential for breeding programs, increasing production, survival, resistance to diseases, and various environmental changing conditions. The aim of the present study was to determine the sequence of HVR1 from the mitochondrial genome of Iranian native goats including Sistani, Pakistani, Black and Lorry ecotypes...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Nucleic acids research
دوره 18 22 شماره
صفحات -
تاریخ انتشار 1990